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Abstract 



It is still unclear how an evolutionary algo- 
rithm (EA) searches a fitness landscape, and 
on what fitness landscapes a particular EA will 
do well. The validity of the building-block hy- 
pothesis, a major tenet of traditional genetic 
algorithm theory, remains controversial despite 
its continued use to justify claims about EAs. 
This paper outlines a research program to be- 
gin to answer some of these open questions, 
by extending the work done in the royal road 
project. The short-term goal is to find a simple 
class of functions which the simple genetic al- 
gorithm optimizes better than other optimiza- 
tion methods, such as hillclimbers. A dialec- 
tical heuristic for searching for such a class is 
introduced. As an example of using the heuris- 
tic, the simple genetic algorithm is compared 
with a set of hillclimbers on a simple subset of 
the hyperplane-defined functions, the pothole 
functions. 



1 BACKGROUND 

Evolutionary algorithms (EAs) are computational 
search methods based on biological evolution. Some 
common EAs are genetic algorithms (GAs) [12, 9, 18], 
evolutionary programming (EP) [6], evolution strategies 
(ESs) [24], genetic programming (CP) [17], and classifier 
systems [14]. The study of EAs is called evolutionary 
computation (EC). 

EAs are increasingly important in such areas as function 
optimization, machine learning, and modeling. How- 
ever, as Mitchell et al. emphasized in the royal road 
(RR) papers [19, 7, 20], it is still unclear how an EA 
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searches a fitness landscape, or even what an EA does. It 
is also unknown what types of problem are easy or hard 
for a particular EA, how various landscape features af- 
fect problem difficulty for an EA, or under what circum- 
stances an EA will outperform another search method. 
Even less work has been done to classify the features of 
real-world problems that may be relevant to EA perfor- 
mance. Moreover, the selection of EA parameters such 
as mutation rate or population size is still largely a black 
art, despite some promising research in this area. This 
lack of theory makes the selection and configuration of 
an EA for a given problem difficult. 

A major open question in EC is the function and im- 
portance of the crossover operator, which recombines 
two individuals. Holland [12] has argued that crossover 
is central to an EA's efficacy. The theoretical basis for 
this is the building-block hypothesis (BBH) [9], which 
states that an EA uses crossover to repeatedly com- 
bine compact subsolutions with high observed fitness 
from different individuals, forming more complete sub- 
solutions with even higher observed fitness. Such sub- 
solutions are called building blocks. When an EA uses 
crossover on symbol strings from the set A , where A 
is the set of possible symbols and I is the length of 
a string, the building blocks are short schemata with 
high observed fitness. Schemata are members of the 
set (A U {*}) f , where * is a wildcard symbol. They 
are hyperplanes in the search space. Holland's schema 
theorem [12] implies that short schemata with consis- 
tently above-average observed fitness tend to increase 
exponentially in frequency over several generations. (If 
operators other than reproduction are neglected, this is 
true for all partitions of the search space, not just for a 
partition into schemata. However, applying crossover to 
symbol strings induces a natural partition of the space 
into schemata [27]. Furthermore, short schemata are 
preserved by crossover. This makes schemata particu- 
larly relevant when studying EAs that use crossover on 
strings.) Applying crossover to individuals with high 
fitness is a plausible heuristic for generating offspring 
that will also be highly fit. The chance that this heuris- 
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tic succeeds can be quantified using Price's covariance 
and selection theorem [1]. Implicit in the BBH is also 
the hypothesis that there are many real-world problems 
amenable to solution by this process. 

A common misconception is that a schema has a unique, 
well-defined fitness, which is the average fitness of all of 
its possible instances, and that observed fitnesses are 
estimates of these "actual" values. In general, no such 
unique schema fitness exists, and the schema theorem 
makes no such assumption [11]: The observed fitness of 
a schema is the average fitness of its instances in the 
current population. This value depends on the distri- 
bution of schemata in the current population, which is 
biased by the EA over time. A uniform distribution is 
only seen immediately after generating an initial random 
population, if ever. Hence, there is no justification for 
arbitrarily defining a schema's fitness to be the average 
over a uniform distribution. A schema may be a highly- 
fit building block in one population but not in another, 
even under the same fitness function. Grefenstette [10] 
made essentially the same point when he criticized the 
"static building-block hypothesis" . 

The BBH is often used to explain how EAs work and 
to justify the importance of crossover. However, there 
is no theory that specifies in detail the conditions nec- 
essary for the BBH to be valid and thus for crossover to 
be beneficial. While there is empirical evidence in favor 
of the BBH [13], its validity in general for the SGA and 
for other EAs using crossover remains controversial [6], 
and the schema theorem's relevance to EA theory has 
been questioned as well [27, 28]. In particular, uniform 
crossover, which is more likely to break up short build- 
ing blocks than traditional crossover, is very effective on 
some problems [26], and it may be that on some prob- 
lems crossover acts as a macromutation operator, rather 
than as an operator that recombines building blocks [16]. 
More generally, it is not clear how to formulate a BBH 
that is valid for an arbitrary EA operating on an arbi- 
trary representation of solutions [22] . 

2 RESEARCH PROGRAM 

This paper presents a research program to extend the 
RR papers [19, 7, 20] in testing the validity of the 
BBH, focusing specifically on the simple genetic al- 
gorithm (SGA) [9]. The SGA is a GA that uses 
fitness-proportionate selection, single-point crossover, 
and point mutation to evolve a single panmictic pop- 
ulation of bit strings, with each generation completely 
replacing the previous one. I focus on the SGA because 
it is a relatively simple EA with a large theoretical lit- 
erature and because many EAs descend directly or in- 
directly from it. A theory developed for the SGA has a 
relatively good chance of being applicable to other EAs. 



As in the RR papers, I use function optimization to 
compare the SGA with other search algorithms. I do 
this because it is an increasingly important application 
for EAs, with relatively clear performance criteria, and 
because a simple fitness function is easy to design and 
implement. Also, function optimization can be viewed 
as search, so theories developed for it may be relevant 
to other applications of search, for instance artificial in- 
telligence [21] and evolutionary biology [30]. 

As De Jong [2] pointed out, the SGA is not a function 
optimizer, per se. But if the BBH is valid, the SGA 
should work particularly well as an optimizer on func- 
tions rich in building blocks that can be recombined 
to reach the optimum. Hence, to determine the valid- 
ity of the BBH, it would be useful to know the class 
of functions on which the SGA outperforms other opti- 
mizers. Examining what makes this class of functions 
particularly easy for the SGA will also help us to pre- 
dict which functions the SGA will perform well on. One 
step towards this goal is to find a simple class of func- 
tions on which the SGA outperforms hillclimbers. This 
was the goal of the RR papers and is also the immedi- 
ate goal here. To meet this goal, the SGA should con- 
sistently perform extremely well on the functions and 
outperform hillclimbers by a wide margin, for a reason- 
able set of performance criteria. (Note that this is a 
different question from that addressed by Wolpert and 
Macready's [29] no free lunch (NFL) theorem.) 

This paper takes a different approach from that of the 
RR papers, although the ultimate goal remains the 
same. In those papers, Forrest and Mitchell [7] deter- 
mined that hitchhiking was a major factor limiting SGA 
performance on the RR functions, causing it to perform 
worse than the random mutation hillclimber (RMHC). 
Hitchhiking occurs when detrimental or neutral alleles 
increase in frequency due to the presence of nearby ben- 
eficial alleles on the same chromosome [8]. This can 
cause beneficial alleles at the same loci as the hitchhik- 
ing alleles to die out in the population, preventing the 
SGA from finding any highly-fit individuals that con- 
tain those alleles. (The fact that schemata do not have 
unique, well defined fitnesses is a necessary precondi- 
tion for hitchhiking.) After identifying this problem, 
Mitchell ct al. [20] investigated how to make the SGA 
perform more like an idealized genetic algorithm that 
was unaffected by hitchhiking. They developed the RR 
function i?4, which reduced hitchhiking by lowering the 
fitness jump from one level of the function to the next. 
In effect, they made the SGA outperform the RMHC by 
making the functions easier for the SGA. In contrast, I 
attempt to make them harder for hillclimbers. The RR 
functions are easy for hillclimbers like the RMHC be- 
cause they are convex: An algorithm never needs to go 
downhill in order to reach the global optimum. To make 
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these functions hard for hillclimbers, I add potholes to 
them: valleys in the fitness landscape that block a hill- 
climber's path to the optimum [19]. This produces the 
pothole functions, described in Section 3. (Holland pro- 
posed a class of RR functions, described by Jones [15], 
that also contained potholes.) 

Pothole functions are a very simple subset of Holland's 
hyperplane-defined functions (HDFs) [13]; potholes are 
examples of HDF refinements. A long-term goal is to 
design a class of pothole functions with parameters that 
can be varied to select the landscape features present 
in a function, as well as its overall difficulty. An arbi- 
trary number of functions could then be generated with 
the desired characteristics by using those parameters to 
define probability distributions, which in turn could be 
used to choose the schemata that contribute to an in- 
dividual's fitness, along with their fitness contributions. 
(In this paper, these will be called a function's signifi- 
cant schemata.) Such a class would allow a researcher 
to use statistical methods to calculate the certainty of 
statements about an algorithm's performance across the 
entire class, while being easier to understand than more 
general classes of HDFs. However, it is not yet clear 
what parameters or distributions should be used, if the 
goal is to describe a class of functions that is easy for 
the SGA yet hard for hillclimbers. The immediate goal 
is to use hand-designed pothole functions as testbeds to 
determine what regions of distribution space should be 
used for randomly-generated functions. 

I base my work on the RR functions because they were 
explicitly designed to investigate the validity of the BBH 
by studying the SGA's performance on functions rich in 
building blocks. The significant schemata in a RR func- 
tion are not building blocks in every population, since 
their fitness depends on the current population. How- 
ever, the functions arc defined so that they are build- 
ing blocks in all contexts except in the occurrence of 
hitchhiking, since they make only positive fitness con- 
tributions. The functions are "rich in building blocks" 
in this sense; the pothole function pi described in Sec- 
tion 3 has the same characteristic. In this paper, a 
schema that makes a positive fitness contribution (ig- 
noring the fitness contribution of other schemata) will 
be called a beneficial schema. (It is possible to define 
a building block as any beneficial schema, in contrast 
to the definition given earlier. This definition is related 
to Fisher's [5] average excess, but it makes the rela- 
tionship between the BBH and the schema theorem less 
clear [J. H. Holland, personal communication].) Like 
the RR functions, the pothole functions are not meant 
to be realistic. Since the fitness contribution of every 
schema is specified in advance, schemata can be used 
as tracers: They can be related to individual landscape 
features, and their frequency in the EA population can 



1. Create a function that is easy for the SGA, for some 
performance criteria. 

2. Use domain-specific knowledge to design a simple 
algorithm that is able to optimize that function 
better than the SGA. If no such algorithm can be 
found, or if all such algorithms incorporate unrea- 
sonable amounts of domain-specific knowledge, go 
to Step 4. 

3. Modify the function so that it is hard for the sim- 
ple algorithm yet still easy for the SGA. If no such 
function can be found, go back to Step 1 and start 
over. Otherwise, go back to Step 2. 

4. Stop — a candidate function has been found. 

Algorithm 1: A dialectical heuristic for finding a simple 
function that is easy for the SGA but hard for other op- 
timizers. (Note that this heuristic may never succeed.) 

be tracked over time [19]. Hypotheses about the effects 
of various landscape features on EA behavior can then 
be formulated and tested. This knowledge can then be 
applied to the study of real-world functions. 

Given enough domain-specific knowledge, it is plausible 
that a specialized optimization method can be designed 
to outperform the SGA on any sufficiently restricted 
class of functions. (The NFL theorem does not hold if 
the subset of functions being considered has measure 
in distribution; this is true for many subsets of inter- 
est, in particular all countable subsets [J. H. Holland, 
personal communication].) Therefore, the issue is not 
whether the SGA will outperform all other algorithms 
on a given class. Rather, it is: How much domain- 
specific knowledge is it reasonable to incorporate into an 
algorithm before it becomes over-specialized or too ex- 
pensive to design and implement, outweighing any per- 
formance advantage over the SGA? A related question 
is: How broad must a class of functions be before the 
SGA outperforms a specialized optimizer on it? More 
generally, the RR papers suggest a dialectical heuristic 
to search for a simple function that is easy for the SGA 
but hard for hillclimbers (Algorithm 1). (Note that "di- 
alectic" here simply denotes "the existence or working 
of opposing forces" [25].) While this heuristic is very 
straightforward, it has never been articulated explicitly. 
Since it is a heuristic, it may never succeed; however 
it is a plausible way to search for the desired class of 
functions. 

The remainder of this paper provides a concrete exam- 
ple of using this heuristic. The pothole function p\ is 
introduced and shown to be difficult for the RMHC but 
easy for the SGA. Then a variant of the RMHC, the 
lax random- mutation hillclimbcr (LRMHC), is defined, 
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which knows the depth of the potholes in p\ and is able 
to cross over them to reach the optimum. This hill- 
climber is shown to outperform the SGA on p\. The 
paper concludes with a discussion of the results. 

3 POTHOLE FUNCTIONS 

Following the dialectical heuristic described in Section 2, 
I modified the RR functions to make them harder for 
simple hillclimbers by adding potholes. Potholes are 
detrimental schemata that contain beneficial schemata, 
and which, in turn, arc necessary to reach beneficial 
schemata with higher fitness contributions [19]. This 
produces the class of pothole functions. All experiments 
in this paper were performed on the pothole function 
pi, which is defined in Table 1. That table lists all of 
the schemata that contribute to an individual's fitness, 
along with their fitness contributions; these schemata 
are called the function's significant schemata. 

The fitness pi(x) of a string x £ {0, l} 64 is given by 

Pl (x) =maxi 0, 100+ ^ ? > W 

where S is the set of significant schemata for p\ , s is a 
schema in S, and ju(s) is the fitness contribution of s. 
The notation x £ s stands for "the string x is an instance 
of the schema s" . Individuals have a base fitness of 100, 
so that in other pothole functions they may be less fit 
than the base fitness without having a negative fitness; 
the fitness is forced to be equal or greater than so that 
fitness-proportionate selection may be used. The global 
optimum is a string of 64 Is, which has a fitness of 115. 

The function consists of 4 levels, defined in Table 1 . The 
first level consists of elementary schemata, each of which 
is a block of 8 Is. Each higher level consists of compound 
schemata composed of schemata from the previous one. 
The elementary and compound schemata are all ben- 
eficial schemata. An algorithm is said to reach a level 
when it finds an individual that is an instance of at least 
one significant schema from that level. 

If p\ consisted only of schemata sq— Su, it would be a RR 
function, similar to R2 [19]. The additional schemata 
S15S31 are potholes. The potholes S15 and sie prevent 
a single- mutation hillclimber, such as the RMHC [7], 
that has reached the first-level schema So from reach- 
ing the second- level schema sg. This is because every 
sequence of single-bit mutations that leads from sq to 
ss would force the hillclimber to go downhill in fitness 
through one of these potholes (assuming neither of them 
is present to begin with), which it cannot do. Similarly, 
the potholes S17 and Sis prevent it from reaching ss if it 
has reached s\. The remaining potholes block the path 
to the other second-level schemata. 



4 EXPERIMENTS ON Pl 

I first compared the SGA against a variety of hill- 
climbers on pi , to verify that it was more difficult than 
the RR function R2 for hillclimbers such as RMHC. 

4.1 SIMPLE GENETIC ALGORITHM (SGA) 

The SGA used a population of 512 individuals. Two 
offspring were produced for each pair of parents, and 
the entire population was replaced in each new gener- 
ation. Standard one-point crossover was used with a 
probability of 0.7 per mating pair. Point mutation was 
applied to each offspring with a probability of 0.005 per 
allele (mutations simply flipped the allele from to 1 or 
vice- versa). Fitness-proportionate, or "roulette wheel", 
selection was used, with cr-truncation scaling [9]: 

/' = max{min[/ - (/- ca), 1.5], 0}. (2) 

Here /' is an individual's scaled fitness, / is its unsealed 
fitness, / is the population average unsealed fitness, a 
is the standard deviation of unsealed fitness in the pop- 
ulation, and the scaling constant c = 2. The maximum 
and minimum possible scaled fitnesses are 1.5 and 0, re- 
spectively. The scaled fitnesses were then used to select 
the parents of the next generation. If a < 0.0001, the 
unsealed fitnesses were used instead. Scaling appears to 
be necessary for the SGA to do well on these functions. 

These parameters were chosen rather arbitrarily, since 
the goal is to find a class of functions that the SGA 
can optimize easily, without being sensitive to the exact 
parameter settings. 

4.2 HILLCLIMBERS 

When comparing the SGA with hillclimbers, it is im- 
portant to report results from a variety of hillclimbers. 
In these experiments, I used the next-ascent hill- 
climber (NAHC), steepest-ascent hillclimber (SAHC), 
and random-mutation hillclimber (RMHC) described 
by Forrest and Mitchell [7], and Jones's crossover hill- 
climber (XOHC) [16]. 

The XOHC used differs from Jones's in that it repeats 
if the maximum number of jumps is reached, until the 
maximum number of evaluations has been performed. 
Jones's original algorithm also quit if no fitness increase 
was found within 10000 steps; the one used here does 
not. 

4.3 PERFORMANCE CRITERIA 

In order for one algorithm to outperform another in this 
study, it should do so over a wide range of reasonable 
performance metrics. I use the number of function eval- 
uations needed to reach the optimum as the primary 
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Table 1: The pothole function p\. An individual's fitness is calculated by summing the fitness contributions of the 
schemata of which the individual is an instance, and then adding this sum to a base fitness of 100. If the result is 
less than 0, it is reset to 0. The global optimum is a string of 64 Is, with a net fitness of 115. 
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performance metric, under the assumption that func- 
tion evaluation dominates an optimizer's running time. 
When one algorithm reaches the optimum more often 
than the other, I use the number of times the optimum 
is reached as the primary performance metric. When 
neither algorithm reaches the optimum, I use the num- 
ber of evaluations needed to reach each level, as well as 
the number of times each level was reached. The max- 
imum fitness reached in each run is also recorded, but 
not shown here. 

Fitness timeseries were also plotted for each algorithm, 
sampled every 512 function evaluations and averaged 
over the set of runs; only those for the SGA and two of 
the hillclimbcrs are shown here. (Individual runs were 
also plotted but are not shown here.) These provide 
much more information about the course of each run, 
including the rate of improvement in fitness. For the 
SGA, the population best and average unsealed fitness 
are plotted; for the hillclimbers the fitness of the best 
individual evaluated so far is plotted, along with the 
fitness of the current individual being evaluated. 



4.4 EXPERIMENTAL RESULTS 

The SGA, NAHC, SAHC, RMHC, and XOHC were each 
run 50 times on p\ , for 256000 function evaluations per 
run. The results are presented in Table 2. Timeseries 
for the SGA and RMHC are presented in Figures 1-2. 
The function pi achieved the goal of being hard for the 
RMHC. Neither it nor any of the other hillclimbcrs ever 
found the optimum. Among these algorithms, only the 
SGA ever found the optimum (level 4), and it did so 
in almost every run. The timeseries for the SGA and 
the RMHC are consistent with the other performance 
metrics. 

5 LAX RANDOM-MUTATION 
HILLCLIMBER (LRMHC) 

Following the dialectical heuristic presented in Section 2, 
the next step was to see how hard it was to design a 
hillclimbcr that outperformed the SGA on p\. Since the 
fitness penalty of each pothole was 0.1, and the fitness 
contribution of each beneficial schema was either 1.0 or 
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Figure 1: SGA population best and average fitness on 
Pi , sampled every generation and averaged over 50 runs. 
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Table 2: The SGA, RMHC, NAHC, SAHC, XOHC, and 
LRMHC on pi, 50 runs, 256000 function evaluations: 
Mean evaluations needed to reach each level (level 4 is 
the optimum) . Here n is the number of times a level was 
reached, x is the sample mean number of evaluations 
needed to reach each level, averaged over n, and s is the 
sample standard deviation of the number of evaluations. 





Level 

12 3 4 


SGA n 

X 

s 


50 50 50 48 
36.9 4026.9 17838.2 65063.2 
32.6 1833.5 13546.4 50361.2 


NAHC n 

X 

s 


50 48 
224.6 72733.2 
219.1 62128.0 


SAHC n 

X, 

s 


50 48 
213.3 71687.3 
183.1 57099.5 


RMHC n 

X 

s 


50 2 
333.7 3133.5 
289.6 1051.5 


XOHC n 

X 

s 


50 50 34 
392.0 10454.8 110519.0 
453.8 8675.6 63414.0 


LRMHC n 
(e = 0.1) x 
s 


50 50 50 50 
249.9 1342.5 3547.1 6244.0 
229.0 915.0 2025.4 3055.2 



Figure 2: RMHC best and current fitness on pi, sam- 
pled every 512 function evaluations and averaged over 
50 runs. 
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Figure 3: LRMHC best and current fitness on p%, sam- 
pled every 512 function evaluations and averaged over 
50 runs, (e = 0.1). 



1.4, there is a foolproof method for determining whether 
a drop in fitness resulted from encountering a pothole, 
or from something else. If the drop is 0.1, then it is due 
to just a pothole and can be ignored. If it greater than 
or equal to 0.8, one or more beneficial schemata have 
been lost. 

The lax random-mutation hillclimber (LRMHC) listed 
in Algorithm 2 resulted from incorporating this domain- 
specific knowledge into the RMHC. The LRMHC is ex- 
actly like the RMHC, except that it accepts any new 
string whose fitness is no more than e below the fit- 
ness of the current string; in these experiments, e is 0.1. 
(On pi, e can be any value in the interval [0.1,0.8).) 
The algorithm is very similar to the constant threshold 
algorithm (CTA) developed independently by Quick et 
al. [23]. The only difference is that the CTA accepts a 
new string only if its fitness is strictly greater than the 
old string's fitness minus e, rather than greater than or 
equal as in the LRMHC. (Due to a typo, the LRMHC 
algorithm published by Holland [13] also differs in this 
way from the algorithm listed here.) In turn, both al- 
gorithms are similar to the record-to-record travel algo- 
rithm and the great deluge algorithm [3] , and to thresh- 
old accepting [4]. 
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1. Initialize the current individual to a random string. 

2. Mutate one randomly-chosen allele. If the new 
string has a fitness equal to or greater than the cur- 
rent individual's fitness minus e, replace the current 
individual with the new individual. 

3. If the number of fitness evaluations performed so far 
is less than the maximum, go to Step 2. Otherwise, 
stop. 

Algorithm 2: The lax random- mutation hillclimbcr 
(LRMHC) algorithm, e is set to 0.1 in this paper. 

In effect, the LRMHC assumes that any function it 
encounters has potholes that all have a depth of no 
more than e, and that it can ignore them since build- 
ing blocks have an observed fitness contribution higher 
than this value. It might seem unreasonable to incorpo- 
rate this knowledge into the LRMHC. But the RMHC 
can be viewed as incorporating just as much knowledge; 
it merely assumes that the pothole depth is always 0. 
The issue here is not whether the LRMHC is a useful 
general-purpose optimizer for real functions. Rather, it 
is: How much domain-specific knowledge must be built 
into a hillclimber so that it outperforms the SGA on p\l 

5.1 LRMHC EXPERIMENTAL RESULTS 

Results for LRMHC on p\ are shown in Table 2 and 
Figure 2. It outperforms the SGA and all of the other 
algorithms on p\ , always finding the optimum, and find- 
ing it much faster than the SGA. The timeseries for 
LRMHC also shows a rapid increase in fitness over the 
SGA and RMHC. These results demonstrate that there 
is a very simple algorithm that outperforms the SGA 
by a wide margin on p\. Similarly, Quick et al. [23] 
showed that the CTA outperformed a GA variant on a 
class of RR functions proposed by Holland and described 
by Jones [15], which also contained potholes. (However, 
the SGA outperforms this class of hillclimber on the RR 
function RA [20].) 

6 CONCLUSIONS 

This paper has presented a research program to investi- 
gate the validity of the BBH, as well as some preliminary 
results. A dialectical heuristic for finding a simple class 
of functions on which the SGA outperformed other sim- 
ple search algorithms was presented. A class of pothole 
functions was designed by adding potholes to the RR 
functions, in order to make them harder for simple hill- 
climbers. The pothole function p\ was shown to be hard 
for hillclimbers such as the RMHC but easy for the SGA. 
Then a new hillclimber, the LRMHC, was designed by 



incorporating domain-specific knowledge about p\ into 
the RMHC. While LRMHC is not useful as a general- 
purpose optimizer, this simple hillclimber outperformed 
the SGA onp 1; demonstrating that simple pothole func- 
tions such as pi are still too easy for hillclimbers. This 
result does not by itself invalidate the BBH. However, it 
reinforces the finding of the RR papers that simple as- 
sumptions about what functions are especially easy for 
the SGA, relative to other optimizers, are often unjus- 
tified. Simple hillclimbers can be surprisingly effective 
at optimizing simple functions. 

The next step, following the dialectical heuristic from 
Section 2, is to modify p\ so that it becomes hard for 
the LRMHC, while remaining easy for the SGA. First, 
however, the LRMHC's behavior on p\ must be investi- 
gated, in order to predict what kinds of functions it will 
find difficult. 
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